PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1632s0015.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 443aa    MW: 50682.1 Da    PI: 6.4127
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1632s0015.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.19e-27114208185
             trihelix   1 rWtkqevlaLiearremeerlrrgk.........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcp 80 
                          +Wt+++v++Li+a++++++++  ++         +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ +++++++ +
  Cagra.1632s0015.1.p 114 KWTDKMVKLLITAVSYIGDDSTMDSgsrrkfavlQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQVVENPA 203
                          7**************8877777654456677888**********************************************6699999988 PP

             trihelix  81 yfdql 85 
                          ++d++
  Cagra.1632s0015.1.p 204 LLDSI 208
                          88765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.6E-24112235No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 443 aa     Download sequence    Send to blast
MDGNFPQGGV VRGGASSYGG FDLQGSMRVH HPESMNQHRH NPNSRPLHEG LPFTMVTGQT  60
CDHHQNISVA EQHKGEREKN SVSDDDEPSF TEEGGDGHNE ANKSAKGSPW QRVKWTDKMV  120
KLLITAVSYI GDDSTMDSGS RRKFAVLQKK GKWKSVSKVM AERGYHVSPQ QCEDKFNDLN  180
KRYKKLNDML GRGTSCQVVE NPALLDSIGY LNEKEKDDVR KIMSSKHLFY EEMCSYHNGN  240
RLHLPHDLAL QRSLQLALRN RDDHDNEDSR KHQMEDLDDE DHDGEGDEHD EYEEQHYSHG  300
DCRGIHYGGG GLGGGPLKKI RQSHSHEDAD HPSHVNSLEC NKVSLPQMPF SQADVNQGGA  360
ESGRSASMQK QWMESRTLQL EEQKLQIQVE LLELEKQRFR WQRFSKKRDQ ELERMRMENE  420
RMKLENDRMG LELKQRELGV EL*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0458500.0AY045850.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankAY0913780.0AY091378.1 Arabidopsis thaliana unknown protein (At1g21200) mRNA, complete cds.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
GenBankF16F40.0AC036104.3 Sequence of BAC F16F4 from Arabidopsis thaliana chromosome 1, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006307528.10.0hypothetical protein CARUB_v10009151mg
RefseqXP_006307529.10.0hypothetical protein CARUB_v10009151mg
TrEMBLR0GX820.0R0GX82_9BRAS; Uncharacterized protein
STRINGfgenesh2_kg.1__2312__AT1G21200.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.10.0sequence-specific DNA binding transcription factors